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Abstract 

Building on research utilizing class size and student achievement as an independent and a 
dependent variable, respectively, the author examined patterns of achievement among minority 
and white students in small (S), regular (R), and regular-with-aide (RA) class-size 
treatments over varying lengths of time and at varying start-up years, grades K, 1, 2, and 3. 
In a re-analysis of norm- (NRT) and criterion-referenced (CRT) achievement test data from 
Tennessee's Student Teacher Achievement Ratio (STAR) Project, the writer reports the 
following results: (1) Minority and white achievement for all years and class-sizes indicated 
that subsamples in (S) tended to obtain higher NRT and CRT mean scaled scores in reading and 
mathematics than either (R) or (RA) subsamples. Analyses consistently revealed a pattern of 
comparatively high 1 year mean scores with diminishing 2 years and 3 years scores followed by 
the greatest achievement in 4 years subsamples. The major difference between the minority and 
white pattern was the magnitude of the between-treatment subsample means: differences for 
white subsamples tended to be compressed relative to differences between minority subsamples, 
that is, class-size appeared to make a bigger difference for minority students than for white 
students. Differences between minority and white achievement for all years and class-sizes 
tended to be smallest for the 2 years and largest for the 3 years time in treatment. Achievement 
differences between minority (S) classes and white (R) classes for all years revealed the 
smallest gaps at grade K and grade 1, 2 years. Compared to white-minority differences 
displayed within any one class-size, the gaps that resulted from white (R) and minority (S) 
were smaller on every achievement measure and at every grade level. Achievement differences 
between white (R) and minority (R) and (S) cohorts that matriculated synchronistically with 
the four years of STAR class-size treatment were smallest after one year (CRT outcomes) or 
two years (NRT outcomes). 

The findings of this re-analysis support the following conclusions: (1) Small class-size 
may be an effective white-minority achievement gap reduction strategy in the primary school 
years; (2) Benefits accrue initially in greater measure to minorities than to whites; (3) 
Optimally beneficial treatment begins no later than grade 1 (preferably in K); (4) Optimally 
beneficial treatment lasts at least two years; (5) The major benefit of small-classes for 
minorities appears to be in preventing rather than remediating achievement disadvantagement; 
(6) The differential effect of small class-size for minorities appears to "fade" following two 
years treatment or in grade 3; and (7) As it applies uniquely to small class-size, the "fade" 
phenomenon of early treatments can be neither confirmed nor contradicted by this re-analysis. 
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(STAR) Project researchers C. M. Achilles of the University of North Carolina at Greensboro 
and B. Nye and D. Fulton of the Tennessee Center of Excellence for Research in Basic Skills, 
Tennessee State University, Nashville, TN. (Results expressed in this study do not necessarily 
reflect the position of the Center or its staff.) Thanks also to personnel in the Knox County, TN 
schools for confirming grade retention within a portion of the STAR data. 
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Class Size as an Early Intervention Strategy in 
White-Minority Achievement Gap Reduction 



The Problem 

One of the greatest challenges facing U. S. education policymakers and practitioners today 
is reducing differences in achievement among students of different ethnic groups. Despite 
research suggesting that schools can educate students successfully without regard to students' 
categorical membership (e.g., Edmonds, 1979; Lezotte, 1989), many schools fail to do so. 
Specifically, students of color and poverty tend to achieve less than their white and 
economically advantaged peers. Forty years after Brown vs. Topeka and nearly 30 years after 
the Elementary ahd Secondary Education Act of 1965 pledged our national commitment to equity 
and excellence in education, the white-minority gap is still unacceptably large and troubling 
(Jennings, 1992; Levin, 1990; Mullis, Owens, & Phillips, 1990). 

The size and nature of the achievement gap depend on a combination of the indicator, the 
location, and the minority group in question. Data compiled by the Sandia National Laboratory 
(1991) are illuminative: The Scholastic Aptitude Test (SAT) 1991 national mean score for 
Asian-Americans, for example, was actually higher than that of their white classmates. 
Similarly, retention and college attendance data indicated that there is, in fact, no achievement 
gap with respect to the Asian r American minority student. Black and Hispanic student data, on 
the other hand, demonstrated wide disparities. Using again, the 1991 SAT as an indicator, the 
mean score for black students was approximately 200 points below the white student mean 
while Hispanic students experienced, on average, a 130 point disadvantage. Similarly, black 
and Hispanic minorities were shown to drop out of school at proportionately higher rates than 
their white classmates. Tht .\\a indicated that nearly 80% of white students complete high 
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school "on time/ and about 88% do so by age 25. Alternately, only 70% of black students and 
50% of Hispanics graduate "on time." By age 25 t the graduation rate for blacks has climbed to 
about 82% (still 6% lower than whites), while only 60% of Hispanics have acquired their 
diplomas. When location is factored in, however, the data showed that the major gaps in white- 
minority achievement were occuring in the large urban school districts and could be as readily 
explained as a function of poverty as of ethnicity. 

Other databases have shown equally severe educational achievement disadvantages of 
minority children. The Southern Regional Education Board (SREB, 1986) reported that 
minority students are often "two to three or more grade levels behind on achievement measures. 
On National Assessment Tests, the reading achievement levels of Black eleventh grade students 
is basically the same as for White seventh graders (p. iv)." Results of the National Assessment 
of Educational Progress (NAEP) reading and mathematics tests for students grades 4, 8, and 12 
(MuIIis, Owens, & Phillips, 1990) demonstrated large white-minority differences in mean 
scores at every grade level and on both tests. A preliminary report of the 1992 NAEP 
assessment stated: "Although average performance for white students increased at each grade 
level . . . achievement for black and Hispanic students increased only at grade 12, and the gaps 
between whites' and blacks' performance remained substantial (Rothman, 1993a, p. 23)/ 
Education Week later reported: "Wide Racial Gap Found on Open-Ended Math Items (Rothman, 
1993b, p. 18);" and "Students' Reading Skills Fall Short, NAEP Data Find Achievement Gap for 
Blacks Remains Wide (Viadero, 1993, p. 1). H 

Differences abound, too, in terms of placement in special educational programs 
(Hathaway, 1990, p. 224): black students are approximately three times as likely to be in a 
class for the educable mentally retarded but only half as likely to be in a class for the gifted and 
talented; only about one-third of the estimated 2.7 million limited English speaking proficient 



(LEP) students aged 5 to 14 receive special help commensurate with their linguistic needs; 
black and Hispanic students are two to three times as likely to be suspended or expelled and only 
half as likely to be enrolled in courses that lead to college preparation. Consequently, black and 
Hispanic students continue to be significantly underrepresented in university degree programs 
(Marks, 1985; Sandia National Laboratories, 1991). 

Black males appear to fare particularly poorly vis-a-vis their white classmates. Likened 
to an "endangered species" (Wright, 1992), black men, who make up just six percent of the U. 
S. population, are now three percent of college student enrollment and 47% of America's prison 
population (Hodgkinson, 1992, p. 3). Since 82% of America's prisoners, each of whom costs 
about $22,500 annually, are high school dropouts (Hodgkinson, 1992, p. 3), one need not be a 
rocket scientist to conclude that if schools could but improve the achievement of the black male 
subgroup alone, the savings to society may be profound. 

In summary, whether one considers scores on norm-referenced standardized tests (NRTs) 
of achievement, dropout data, placement in special educational programs, or post-secondary 
school status, minorities lag behind their white counterparts in educational achievement. 
Accordingly, education policymakers must seek a reasonable option for reducing this disparity 
that is at once widely available, cost efficient, easily implemented and operational as a public 
school option. Because evidence suggests that without intervention, the achievement gap for any 
one minority student may increase dramatically over the course of the school career (Haycock, 
1990), education researchers are challenged to investigate early intervention strategies which 
promise to reduce the gap. Among the fruits of such an investigation might be an enhanced 
understanding of all variables involved, including the comparative effects of duration and 
incipience of treatment (x) on student achievement. Simply stated, policy must be guided by 
research that answers the question: If x, how much and when? 



Purpose and Significance of the Study 
Results of the Student/Teacher Achievement Ratio (STAR) Project (Word, Johnson, Pate- 
ABain, Fulton, Zaharias, Achilles, Lintz, Folger, & Breda, 1990) and its progeny, the Lasting 
Benefits Study (LBS) (Nye, Achilles, Zaharias, Fulton, & Wallenhorst, 1992a) and Project 
CHALLENGE (Nye at aL 1992b), suggested that small class-size in the primary grades (K-3) 
may be effective in closing the white-minority achievement gap. Funded by the Tennessee 
legislature, STAR was a large-scale experiment in which kindergarten students and teachers 
were randomly assigned to small and large classes within each participating school. Students 
remained in these class-types for four years. At the end of each year, they were measured in 
reading and mathematics by standardized norm- and curriculum (criterion)- based tests. The 
results indicated that (a) a significant benefit accrued to students in reduced-size classes in 
both subject areas and (b) there was evidence to suggest that minority students in small classes 
outperformed their peers in kindergarten classes of regular size and also gained more in reading 
outcomes the second and subsequent years (Finn & Achilles, i990). Summarizing the pertinent 
findings, Finn and Achilles asserted: 

In addition to an overall class-size effect, there is strong indication that the performance of 
minority students is enhanced in th.9 small-class netting. This important outcome is 
statistically confirmed only in inner-city and suburban areas, but the same trend is seen in 
urban and rural schools as well. Also, minority students in the longitudinal subsample 
experienced greater relative growth than white students in the second year of small-class 
participation (p. 574). 

Despite the minority student achievement benefits attributed to small class-size, however, 
the variables of (a) duration of (i.e., time spent in) "treatment" and (b) incipience of (i.e., 
year of fire* exposure to) "treatment" in a controlled experimental study remained 
unnaccounted for. (Neither LBS nor Project CHALLENGE are experiments.) Essentially missing 
from the knowledge base, then, was a quantative description of student achievement as a function 



of length of time in and school year(s) of exposure to class-size treatments. Accordingly, the 
primary purpose of this study was to examine patterns of achievement among white and 
minority students in small (S), regular (R), and regular-wi:h-aide (RA) class-size 
treatments over varying lengths of time and at varying start-up yeans, grades K, 1, 2, and 3. 
The researcher was guided by STAR, LBS, and CHALLENGE findings (Achilles, Nye, Zaharias, 
Fulton, & Bingham, 1993) which suggested that (a) earlier intervention may be more effective 
than later intervention, (b) the benefits of intervention may be cumulative, and (c) small 
class-size treatment may be more effective in preventing than in remediating the white- 
minority achievement gap. In addition to enhancing education researchers' understanding of the 
interaction of ethnicity and the duration and incipience of class-size "treatment" with student . 
achievement, the findings suggest to education policymakers guidelines for effective 
implementation of small class-size as a cost efficient public school option. 

Methodology 

Commensurate with decision-oriented educational research and the use of existing databases 
(Cooley & Bickle, 1986; Stakenas, 1989), the present study was a re-analysis of data collected 
. in the Student Teacher Achievement Ratio (STAR) Project (Word et al., 1990). Accordingly, 
the (1) STAR subjects, (2) STAR instruments and procedures, and (3) procedures employed in 
the present re-analysis will be discussed. 
STAR Subjects 

The Project STAR design provided for four years of randomly assigned small (S), 
regular (R), and regular-with-aide (RA) classroom "treatments" for a single cohort of 
approximately 7,000 student subjects who began Tennessee public school kindergarten in 1985 
and who completed third grade in 1989. Ideally, STAR researchers intended for students 
assigned to one of the three class types as kindergarteners to persist in that class type all four 
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years; however, as new students moved into the attendance zone of an experimental school, they, 
too, were randomly assigned to one of the three conditions. Similarly, STAR researchers 
encountered subject attrition through the moving away of students from experimental schools 
and classrooms. Consequently, not ail STAR students received four years of treatment. By the 
final year of the project, about one-third of the students had been in the same class type all four 
years; the remaining two-thirds were replacements and additions. 
STAR Instruments and Procedures 

Although STAR researchers collected data using instruments specifically selected for Project 
STAR (e.g., the Self-Concept and Motivation Inventory), the primary student achievement 
outcome measures were obtained from instruments mandated by the the TN Department of 
Education Testing Program. Both norm-referenced tests (NRT's) and criterion-referenced 
tests (CRTs) were included. The NRT's were nationally-normed, published instruments, while 
the CRTs were developed by the TN Department of Education. Commensurate with the state 
testing program, the instruments and testing procedures employed were a function of whether 
the sample was in kindergarten or in grades 1- 3. 

Kindergarten . In the kindergarten sample, only a NRT, the Stanford Early School 
Achievement Test (SESAT) Form 2 (The Psychological Corporation, 1985), was administered. 
Developed as a downward extension of the Stanford Achievement Test (SAT) Series, the SESAT is 
a group test intended to measure school achievement in grades K.5-1.5. Scores were obtained 
in (1) Sounds and Letters, (2) Word fteading, (3) Total Reading, (4) Mathematics, (5) 
Listening to Words and Stories, (6) Total for Basic Battery, (7) Environment, and (8) Total 
for Complete Battery. Although validity coefficients (Person Product-Moment) between the 
SESAT and the Otis-Lennon School Ability Test (.81 for the complete battery) and reliability 
coefficients (Kuder-Richardson 20) between the SESAT and the SAT Primary 1 instrument 



(.45 to .52 on specific subscales) are modest, Ackerman (1989) suggests, M At a level of 
aggregation that considers intact classes as a whole in comparison to state and national norms, 
the test may provide useful information (p. 866)." For the purposes of this study, only the 
Total Reading and Mathematics scores were analyzed. According to the Technical Data Report 
(The Psychological Corporation, 1985) compiled by the test publisher, KR-20 r's were .93 
and .81 for the total reading and total math respectively (p. 32). The SESAT was administered 
during the spring of the kindergarten year under controlled testing conditions including the use 
of trained substitute teachers as proctors. 

Grades 1-3 . Beginning in gradel, the Stanford Achievement Test (SAT) Primary 1 was 
employed as the NRT instrument. The SAT is a nationally normed group test intended to measure 
"the important learning outcomes of the school curriculum (The Psychological Corporation, 
1985)." Scores for the Primary 1 form (administered in grades 1 and 2) were obtained in (1) 
Word Study Skills, (2) Word Reading, (3) Reading Comprehension, (4) Total Reading, (5) 
Concepts of Number, (6) Mathematics Computation, (7) Mathematics Applications, (8) Total 
Mathematics, (9) Language, (10) Spelling, (11) Environment, (12) Listening, (13) Basic 
Battery, and (14) Complete Battery. Primary 2 form, administered in grade 3, is identical to 
Primary 1 except for substituting "Reading Vocabulary" for "Word Reading." Kuder- 
Richardson (KR) 20 reliability coefficients have been obtained for each test and subtest for 
each form and level. These range from .85 to over .90. Alternative form reliability 
coefficients tend to be slightly lower than the corresponding KR-20 coefficients, but almost all 
are .80 or higher (Carpenter, 1989). The present study utilized only the Total Reading and 
Total Mathematics subscores as student achievement NRT outcome measures. 

Beginning in grade 1, the TN Department of Education Testing Program provided for the use 
of state-developed curriculum-referenced (criterion-referenced) tests locally known as Basic 
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Skills First (BSF) tests (TN Department of Education, 1987). These tests were created from 
well-specified lists of objectives in reading and mathematics at each grade level, and can be 
scored either as the totaf number of items answered correctly, or as pass-fail. A student 
passes if he/she masters 80% of the objectives covered by the test items. The present study 
used the pass-fail method in the reported analyses. No reliablity data are available for the BSF 
tests. The SAT and BSF tests were administered during the spring of the grades 1, 2, and 3 
years under controlled conditions. 
Procedures Employed in the Present Re-Analvsis 

To examine the absolute and comparative achievement patterns of minority versus white 
students in small (S), regular (R), and regular-with-aide (RA) class-sizes as a function of 
time in "treatment" and beginning schoolyear of treatment required: (1) identifying the 
appropriate subsarnple of students in each grade and class-size and for each achievement 
measure; (2) calculating the minority and white subsamples 1 mean scaled scores and standard 
deviations for each achievement measure by length of time in each class-size treatment at each 
grade; (3) tabulating the minority and white subsamples 1 mean scaled scores for each 
achievement measure by length of time in each class-size treatment at each grade; (4) 
tabulating a white-minority achievement gap as shown by differences in mean scaled scores of 
each achievement measure by grade by length of time in each class-size treatment; (5) 
tabulating achievement differences between minority small-class and white regular-class 
subsamples by grade by length of time in class; (6) calculating "effect sizes" to measure the 
"educational importance" of selected mean score achievement differences between subsamples; 
and (7) charting histograms to compare the achievement of white (R) and minority (R) and (S) 
class-sizes. Regarding procedure (6), Glass and Hopkins (1984, p. 54) have explained that 
"educational importance" or "effect size" (ES) can be mathematically represented as the 
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quotient of the difference (gain or loss) between any two means and the standard deviation of 
their combined distributions. It is expressed as a proportion of 1.00 sigma. 

Results 

Minority Student Achievement 

Table 1 shows the results for (S) exposure. Except for the 4 years subsample, both 
SAT (the NRT component) mean scaled scores of minority subsamples which spent 1 year in (S) 
treatment exceed that of subsamples exposed 2 years or 3 years. Similarly, the BSF (the CRT 
component) measures of (S) minority students are, except for the subsample that began in K 
and continued through grade 3, greater for each 1 year subsample than for corresponding 
multiple year subsamples. In fact, except for the subsample that spent 4 years in (S), 
increased time in treatment appears to be associated with lower achievement. On average, both 
SAT and BSF mean differences between 1 year, 2 years, and 3 years exposed subsamples were 
moderately small with effect sizes (ES) < .20 sigma. At 18.73 points (.49 sigma), the point of 
maximal difference was the SAT Math mean scores between the 3 years and 4 years subsamples. 
(Subsample n's and SD's are reported in Appendix A and Appendix B, respectively.) 

Table 2 shows the results for minorities exposed to (R). With one exception (BSF Math, 2 
years), the same pattern of achievement attained by (S) subsamples was shown by (R) 
subsamples. Although displaying a wide range of differences across any one grade, the 
achievement for 1 year subsamples tended to be greater than that of multiple year students. At 
16.88 points (.43 sigma), the point of maximal difference was again the SAT Math mean scores 
between the 3 years and 4 years subsamples. 

Table 3 shows the results for minorities exposed to (RA). As in the (S) and (R) 
subsamples, 1 year achievement measures uniformly exceeded 2 years measures; however, 3 
years subsample mean scores were greater than 1 year or 2 years measures while 4 years 
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Table 1 



Minority Stanford Achievement Test (SAT) and Basic Skills First (BSF) Mean Scaled 
Scores by Grade by Length of Time in Small Class-Size 



Achievement 
Measures 

SAT Reading 



BSF Reading 



SAT Math 



BSF Math 



Grade 

K 

1 

2 

3 

K 

1 

2 

3 

K 

1 

2 

3 

K 

1 

2 

3 



1 year 
439.12 
510.18 
571 .51 
607.79 

27.22 
38.64 
31.17 
479.56 
521 .49 
568 91 
608.79 

38.96 
51.90 
48.00 



Length of Time in Small Class-size 
2 years 3 years 4 years 



495.35 
569.18 
606.19 

24.83 
37.16 
30.71 

503.40 
566.93 
604.39 

35.86 
51.90 
47.62 



561 .37 
594.55 



35.17 
28.03 



556.46 
594.90 



48.49 
44.84 



61 2.78 



32.11 



613.63 



48.85 



1 0 
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Table 2 



Minority Stanford Achievement Test (SAT) and Basic Skills First (BSF) Mean Scaled 
Scores by Grade by Length of Time in Regular Class-Size 



Achievement 
Measures 

SAT Reading 



BSF Reading 



SAT Math 



BSF Math 



Grade 


1 year 


O \/o 0 rc 
ycdlo 


r\ 






1 


A Q 1 Q£ 




o 
c. 


OOO . 0 c. 


O D . f 0 


3 


595.81 


590.1 8 


K 






1 
1 


24.35 


20.41 


2 


35.84 


31.67 


3 


30.01 


29.24 


K 


472.53 




1 


508.39 


484.98 


2 


559.20 


542.68 


3 


600.62 


599.73 


K 






1 


36.45 


31.92 


2 


49.67 


46.71 


3 


46.08 


45.07 



Length of Time in Regular Class-size 

3 years 4 years 



555.95 
592.23 



35.84 
29.04 



558.21 
593.00 



48.56 
44.78 



607.89 



31.87 



609.88 



48.24 



ERIC 
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Table 3 



Minority Stanford Achievement Test (SAT) and Basic Skills First (BSF) Mean Scaled 
Scores by Grade by Length of Time in Regular/Aide Class-Size 



Achievement 
Measures 



SAT Reading 



BSF Reading 



SAT Math 



BSF Math 



Length of Time in Regular/Aide Class-size 



Grade 

K 

1 

2 

3 

K 

1 

2 

3 

K 

1 

2 

3 

K 

1 

2 

3 



I year 

428.61 

493.18 

559.72 

598.26 

24.83 
35.81 
30.00 
469.64 
509.61 
557.51 
598.98 

36.81 
49.91 
46.21 



2 years 

475.33 
542.57 
594.57 

21.93 
30.74 
29.84 

494.58 
531 .85 
595.41 

33.13 
44.42 
45.75 



3 years 



556.80 
600.46 



36.28 
29.97 



557.15 
604.80 



49.50 
47.28 



4 years 



605.18 



31.16 



605.51 



47.46 



1 2 



means exceeded 3 years means. After 1 year, then, there appears to be a positive linear 
relationship between length of time in (RA) and achievement. The average ES between the 
lowest SAT scores in the 2 years subsamples and the highest scores in the 4 years subsamples 
was about .32 sigma. Both BSF outcomes in the (RA) 3 years exposure at grade 2 subsample 
exceeded that of students who had been similarly exposed at grade 3 for all 4 years by about 
5.00 percentage points (.67 sigma). 
White Student Achievement 

Table 4 shows the results of white student subsamples exposed to (S) treatment. For both 
SAT and BSF measures, 1 year subsamples uniformly achieved greater than 2 years and 3 years 
subsamples. White students who began (S) treatment in K and who continued all 4 years, 
however, showed the highest levels of achievement. The ES differences between the 4 years 
subsamples and the 1 year subsamples (about .16 sigma on the SAT measures and .08 sigma on 
the BSF measures) were small. The differences between the 3 years and 4 years subsamples, 
however, increase to .38 sigma and .30 sigma for the SAT and BSF respectively. Obvious in its 
singularity, the grade 2, 3 years subsample outscored the grade 3, 4 years subsample by 3.67 
percentage points (.63 sigma) on the BSF reading measure. 

Table 5 shows the results of white student subsamples exposed to (R). The pattern of 
relatively high 1 year mean scores with diminishing 2 years and 3 years scores, followed by 
the greatest achievement from 4 years subsamples, recurs in these data. Achievement ES 
differences between highest and lowest mean scores average about .43 on the SAT. Student 
subsamples in grade 2, regardless of length of time spent in (R), outscore student subsamples 
exposed 4 years by an average of more than 4.00 percentage points (.67 sigma) on BSF 
measures. The one exception is at grade 2. 1 year. 
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Table 4 



White Stanford Achievement Test (SAT) and Basic Skills First (BSF) Mean Scaled 
Scores by Grade by Length of Time in Small Class-Size 



Achievement 










Measures 






Length of Time in Small Class-size 




Grade 


1 year 


2 years 


3 years 4 years 


SAT Reading 


K 


443.70 






1 


539.49 


513.31 






2 


599.95 


570.18 


578.14 




3 


627.15 


619.10 


615.07 633.43 


BSF Reading 


K 








1 


27.81 


24.55 






2 


41.49 


35.42 


38.30 




3 


34.04 


33.58 


33.33 34.63 


SAT Math 


K 


496.20 






I 


546.67 


523.72 






2 


595.46 


576.87 


575.46 




3 


629.25 


621 .32 


620.35 635.12 


BSF Math 


K 










1 


40.45 


36.34 






2 


54.15 


51.45 


51.15 




3 


53.18 


52.55 


51.86 54.01 



1 4 

17 



Table 5 



White Stanford Achievement Test (SAT) and Basic Skills First (BSF) Mean Scaled 
Scores by Grade by Length of Time in Regular Class-Size 



Achievement 










Measures 






Length of Time in Regular Class-size 




Grade 


1 year 


2 years 


3 years 4 years 


SAT Reading 


K 


439.07 






1 


527.31 


501 .43 






2 


591 .97 


579.68 


578.28 




3 


621 .59 


Ci Q Q Q 

01 o.yts 


ft 1 O ft 1 C07 K1 


BSF Reading 


K 








1 


26.78 


24.16 






2 


40.64 


38.10 


38.66 




3 


33.35 


33.29 


32.09 34.06 


SAT Math 


K 


488.34 








1 


535.53 


514.83 






2 


588.63 


578.50 


572.95 




3 


624.26 


623.73 


615.92 630.43 


BSF Math 


K 










1 


39.78 


36.15 






2 


53.23 


50.57 


50.75 




3 


52.16 


52.26 


50.17 53.29 



1 5 

20 
0 



Table 6 shows the results of white student subsamples treated in (RA). The 1 year 
subsampies again demonstrate consistently higher levels of achievement than either 2 years or 
3 years subsamples. Achievement measures for 4 years subsamples, however, remain 
uniformly highest. The average ES difference between the lowest and highest SAT mean scaled 
scores is about .44 sigma. BSF Reading measures at grade 2, regardless of time spent in 
regular-with-aide class-size, are consistently higher than BSF measures on student 
subsamples exposed 4 years. 

Minority Compared with White Student Achievement 

Having examined the patterns of achievement for minority students and white students 
separately , the comparative achievement, or the achievement gap, of white and minority 
students as a function of class-size type, duration, and incipience of treatment As in the 
previous analyses, tabulations are made of each class-size for white and minority subsamples 
separately (Tables 7-9). However, to determine the possible effect of (S) in preventing the 
achievement gap, the researcher also cross-tabulated minority (S) and white (R) achievement 
differences (Table 10). To compare graphically the K, K+1, K+1 +2, and K+ 1+2+3 
achievement differences for white (R) and minority (R) and minority (S) subsamples, the 
researcher charted histograms, one for each achievement measure (Figures 1-4). 

Table 7 shows the white-minority achievement gap by grade and length of time in (S). 
With one exception (BSF Reading, grade 3, 4 years), the 2 years gaps are smallest and the 3 
years gaps largest. In grade 3, the second smallest gaps vascillate between 1 year and 4 years 
length of time spent in (S). The sizes of the gaps are large, ranging from 27.44 points between 
the grade 1,1 year and 2 years subsamples on the SAT Reading measure to 6.57 percentage 
points between the grade 2, 2 years and grade 3, 3 years subsamples on the BSF Math measure. 

1 6 



Table 6 



White Stanford Achievement Test (SAT) and Basic Skills First (BSF) Mean Scaled 
Scores by Grade by Length of Time in Regular/Aide Class-Size 



Achievement 
Measures 

SAT Reading 



BSF Reading 



SAT Math 



BSF Math 



Grade 

K 

1 

2 

3 

K 

1 

2 

3 

K 

1 

2 

3 

K 

1 

2 

3 



Length of Time in Regular/Aide Class-size 



1 year 
438.88 
533.38 
595.38 
621 .21 

27.03 
40.86 
33.17 
489.53 
537.95 
590.32 
624.10 

39.91 
53.71 
51.96 



2 years 

504.20 
580.77 
61 5.50 

23.68 
37.69 
32.28 

51 7.48 
575.62 
618.22 

36.95 
50.57 
51.45 



3 years 



582.56 
61 2.94 



38.81 
32.00 



584.91 
614.63 



51.80 
49.84 



4 years 



628.09 



34.14 



631 .91 



53.31 
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Table 7 



The White-Minority Achievement Gap: Differences in Stanford Achievement Test (SAT) 
and Basic Skills First (BSF) Mean Scaled Scores by Grade by Length of Time 

in Small Class-Size 



Achievement 
Measures 

SAT Reading 



BSF Reading 



SAT Math 



BSF Math 



Grade 


1 year 


2 years 


K 


9.91 




1 


29.31 


17.96 


2 


28.44 


1.00 


3 


19.36 


12.91 


K 






1 


0.59 


0.28 


2 


2.85 


1.74 


3 


2.87 


2.87 


K 


16.64 




1 


25.18 


20.32 


2 


26.55 


9.94 


3 


20.46 


16.93 


K 






1 


1.49 


0.48 


2 


2.25 


0.45 


3 


5.18 


4.93 



Length of Time in Small Class-size 

3 years 4 years 



16.77 
24.52 



3.13 
5.30 



19.00 
25.45 



2.66 
7.02 



20.65 



2.52 



21.49 



5.16 
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Table 8 shows the white-minority achievement gap by grade and length of time in (Ft). 
Except for the apparent moderate negative linear relationship between the achievement gap and 
length of time spent in (R) suggested by the grade 2 and grade 3 outcomes, no clear patterns 
seem to emerge. Even in the case of the latter, three out of the four achievement measures 
register a difference score gain after 2 years treatment. In short, the gap between both SAT and 
BSF measures seems to vary almost randomly. 

Table 9 shows the white-minority achievement gap obtained from (RA) subsamples. 
Clearly, the grade 3, 3 years length of time in (RA) subsamples mean differences were 
smallest. The average difference between the highest and lowest gap score for the SAT measures 
was more than 22.00 points, while the average percentage passing difference on the BSF was 
more than 2.00 points. For each measure, the grade 3, 4 years subsamples gap was either 
largest or next-to-largest. 

Table 10 shows minority (S) and white (R) achievement differences. On every 
achievement measure, the gaps are smallest at grade K (SAT Reading and Math) or grade 1, 2 
years (BSF Reading and Math). Except for SAT Reading, grade 2, 2 years and SAT Math, grade 2, 
1 year, the grade 3, 3 years gaps are largest. Uniformly, the grade 3, 4 years achievement 
gaps are smaller than the grade 3, 3 years gaps and, except for SAT Reading, smaller than the 2 
years gaps. The sizes of these gaps are consistent with the other analyses, ranging up to .16 
standard deviation and over 2.00 percentage points. Additional results may be obtained by 
comparing Table 10 with the other gap tables (Tables 7-9) showing the white-minority 
differences for any one class-size treatment. On every achievement measure and at every grade 
level, the obtained differences are smaller, sometimes by as much as 23.00 points. 
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Table 8 



The White-Minority Achievement Gap: Differences in Stanford Achievement Test (SAT) 
and Basic Skills First (BSF) Mean Scaled Scores by Grade by Length of Time 

in Regular Class-Size 



Achievement 
Measures 

SAT Reading 



BSF Reading 



SAT Math 



BSF Math 



Grade 

K 

1 

2 

3 

K 

1 

2 

3 

K 

1 

2 

3 

K 

1 

2 

3 



1 year 
11.10 
35.95 
33.15 
25.78 

2.43 
4.80 
3.34 
10.57 
27.14 
29.43 
23.64 

3.33 
3.56 
6.08 



Length of Time in Regular Class-size 
2 years 3 years 4 years 



33.20 
32.95 
28.80 

3.75 
6.43 
4.05 

29.85 
35.82 
24.00 

4.23 
3.86 
7.19 



22.33 
18.38 



2.82 
3.05 



14.74 
22.92 



2.19 
5.39 



19.62 



2.19 



20.55 



5.05 



Table 9 



The White-Minority Achievement Gap: Differences in Stanford Achievement Test (SAT) 
and Basic Skills First (BSF) Mean Scaled Scores by Grade by Length of Time 

in Reg/Aide Class-Size 



BSF Reading 



SAT Math 



BSF Math 



Achievement 

Measures Length of Time in Regular/Aide Class-size 

4 years 

SAT Reading 
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Grade 


1 year 


2 years 


3 years 


K 


10.27 






1 


40.20 


28.87 




2 


35.16 


38.20 


25.76 


3 


22.95 


20.93 


12.48 


K 








1 


2.20 


1.75 




2 


5.05 


6.95 


2.53 


3 


3.17 


2.44 


2.03 


K 


19.89 






1 


28.34 


22.90 




2 


32.81 


43.77 


27.76 


3 


25.12 


22.81 


9.83 


K 








1 


3.10 


3.82 




2 


3.80 


6.15 


2.30 


w 


5.75 


5.70 


2.56 



22.91 



2.98 



26.40 



5.85 



Table 10 



Minority Small-Class and White Regular-Class Subsample Differences in Stanford 
Achievement Test (SAT) and Basic Skills First (BSF) Mean Scaled Scores by 
Grade by Length of Time in Class 



Achievement 

Measures Length of Time in Class 

Grade 1 year 2 years 3 years 4 years 

SAT Reading K 5.35 

1 17.13 6.08 „ 



BSF Reading K 

1 0.44 0.67 



SAT Math K 8.46 

1 14.04 11.43 



BSF Math K 

1 0.82 0.29 



25 



2 20.46 10.50 16.91 

3 13.80 12.79 16.06 14.73 



2 2.00 0.94 3.49 

3 2.18 2.58 4.06 1.95 



2 19.72 11.57 16.49 

3 15.47 19.34 20.83 16.80 



2 1.33 1.33 2.26 

3 4.16 4.64 5.33 4.44 
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Figures 1-4 graphically illustrate the comparative achievement of white (R) and 
minority (R) and minority (S) for each achievement measure across one year (K), two years 
(K+1), three years (K+1+2), and four years (K+1+2+3). Figure 1 shows the results for 
SAT Reading. At each grade level combination, the gap between white (R) and minority (S) is 
narrower than that created by the two (Rs). The gap between white (R) and minority (S) 
achievement is narrowest at grades K and K+1 and grows increasingly larger in K+1+2 and 
K+1 +2+3. In the last two periods, minority (S) and minority (R) achievement is nearly 
identical. Overall year-to-year achievement gains, however, are substantially higher for all 
subsamples. 

Figure 2 shows the results for BSF Reading. (No BSF tests were administered in grade 
K.) In grade 1, the minority (S) subsample outscored bo>h white (R) and minority (R) 
subsamples. In subsequent grades, minority (S) achievement was virtually the same as 
minority (R) achievement, while white (R) achievement was comparatively and uniformly 
higher. For all subsamples, initial surges in achievement from 1 to 1+2 were followed by 
declines in 1+2+3. 

Figure 3 shows the results for SAT Math. This histogram is almost identical to that for 
SAT Reading (Figure 1). The greatest comparative differences occur in K+1. The difference in 
minority (R) and (S) is neglible for both K+1+2 and K+1 +2+3. 

Figure 4 shows the results for BSF Math. As in the case of the histograms for the SAT 
measures, the BSF histograms are remarkably similar. Initial differences (grade 1) are 
maximal. In subsequent periods, the white (R) subsamples achieve passing rates several 
percentage points higher than minority (R) or (S) subsamples. Differences between both 
minority subsamples are but a few hundreths of a percentage point. 
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Conclusions 

The purpose of this investigation was to examine patterns of achievement among white 
and minority students- in small (S), regular (R), and regular-with-aide (RA) class-size 
treatments over varying lengths of time and at varying start-up years, grades K, 1, 2, and 3. 
The investigator reasoned that "best" achievement gap reduction intervention would involve 
delivering treatment in the optimally effective quantity (length of time) and at the optimally 
effective time (start-up year). The investigator reasoned further that, to be a politically and 
educationally acceptable goal, small class-size (or any class-size) as a gap reduction 
intervention must be shown not only to benefit minorities, but not to harm whites. 
Consequently, the investigator examined absolute (within a group) and comparative (between 
the two groups) achievement for minorities and whites. To illustrate the effect of no 
intervention (considered in this context as R) vs. the effect of minority (S) as a gap reduction 
intervention strategy, the investigator extended the comparative examination to include 
different class-sizes within and between groups. 
The Length of Time Issue 

Relative to the length of time issue, the results showed that absolute achievement on 
NRT and CRT measures of both minorities and whites was optimal (highest) for students who had 
experienced all four years of small-class treatment. Moreover, minority between class-size 
differences were relatively larger than white between class-size differences, a fact which 
suggests that the effect of small classes on achievement was of greater benefit to minorities than 
to whites. This finding supports earlier STAR, LBS, and CHALLENGE research (Finn & Achilles, 
1990; Achilles et al., 1992). 

The comparative achievement analyses indicated, however, that the differential benefit 
for minorities did not seem to last. The white-minority achievement gap, smallest after two 
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years time in small-class treatment, appeared to "fade" in subsequent years. (The "fade" issue 
will be addressed in the next section of this paper.) The same finding was obtained in the 
minority small-class vs. white regular-class contrast; however, the resulting gaps were 
uniformly smaller on every measure and at every grade. The diminishing differential effect of 
small class-size for minorities was graphically apparent in the contrast of that group with 
white and minority regular-class students that participated in ali four years of STAR. 
The Start-Up Year Issue 

When one considers absolute achievement, the question of optimizing the benefit of small 
class-size by starting treatment in one or another school year highlights an intriguing finding 
of the present study: Except for the cohort that started in grade K and spent four years time in 
treatment, one year exposed students score higher than multiple year exposed students. The 
answer to this anomoly may, in part, be a function of the retention-promotion practices of STAR 
teachers and the scores of promoted vs. retained students. The percent of grade retentions in 
STAR was smallest in the (S) condition (7.8 vs. 12.6 in the R and 10.8 in the RA, grade 1) 
Moreover, students who were promoted tended to score lower (e.g., 422 in S vs. 427 in R on 
SAT Reading, K). (Percentages and scores are from Word et a!., p. 171.) The combination of 
fewer retainees and lower scores resulted in more "marginal 1 ' students matriculating on time 
with the cohort. Thus when tests were administered at any given grade, more academically 
capable and older students entering Project STAR for the first time outscored the less capable 
nonretained students in the original cohort. Over time, this phenomenon results in an apparent 
"fade" of early intervention benefits (Barnett, 1992). One year treatment means were higher 
than two and three year treatment means, however, not just for small-classes but for all class- 
sizes. A possible explanation is that increased attention was afforded STAR students (and 
teachers) regardless of class-size treatment. Accompanied by the possibility of higher 
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expectations of and for STAR participants, a Hawthorne effect could have been generated. 
(Additional analyses tend not to support this supposition. See Appendix C.) 

The comparative achievement pattern analyses examining white vs. minority differences 
for each class-size found that gap scores tended to be least for the two years in small-class 
group. Interestingly, this finding held true whether the start-up year was K, 1, or even 2; 
however, the smallest gaps were registered by the groups that began treatment in grade 1. 
When minority small-class achievement was compared to white regular-class achievement, the 
gaps were shown to be least at grade K and grade 1, two years treatment. 
What is the Optimal Time in Small Class-Size and When Should It Begin? 

Consistent with prior STAR, LBS, and CHALLENGE research, the findings of this re- 
analysis of the STAR database support the following conclusions: 

(1) Small class-size may be an effective white-minority achievement gap reduction 
strategy in the primary school years. 

(2) Benefits accrue initially in greater measure to minorities than to whites. 

(3) Optimally beneficial (defined as resulting in least white-minority achievement 
differences) treatment begins no later than grade 1 (preferably in K). 

(4) Optimally beneficial treatment lasts at least two years, grade 1 and K or 1 and 2. 

(5) The differential effect of small class-size for minorities appears to "fade" following 
two years treatment or in grade 3. 

(6) As it applies uniquely to small class-size, the "fade" phenomenon in early 
treatments can be neither confirmed nor contradicted by this re-analysis. To the extent that 
STAR small-class students were less likely to be retained and that retention in grade is an 
indicator that a student will drop out of school (Hahn, 1987), the nonretention of minority and 
white students in Project STAR broadens to life-long the optimization small-class treatment. 
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Implications for Policy and Practice 
First, although a cost-benefit analysis exceeds the scope of this study, the expense of 
implementing small classes in the primary years of school must be weighed against the cost of 
remediation (e.g., Chapter 1 reading programs) in later years, the effectiveness of which is 
questionable. As a report of the Perry School Project recently disclosed (Lewis, 1993), the 
benefits of early childhood intervention programs tend to yield lifelong dividends (e.g., higher 
levels of education, higher economic status, and greater social responsibility) for self and 
society. For preschool programs, the latest research suggests that $7.16 is the eventual 
savings to the public for every dollar invested. Although exact figures are impossible to 
estimate at this time, small class-size, too, may be a cost-efficient public school option. Absent 
contradictory evidence, small classes should be implemented in grades K and 1 whenever 
possible. 

Second, education policymakers and practitioners must act in ways that reflect 
sensitivity to the concerns of ethnic and racial minorities. One explanation for why small 
class-size may be of greater initial benefit to minorities than to whites is grounded in Cultural 
Deprivation Theory. According to Lansa and Potter (1984), underlying Cultural Deprivation is 
Piaget's theory that a child who, at an early age, is deprived of appropriate environmental 
stimuli lacks the experiences needed for the development of intellectual skills and abilities (p. 
4). To the extent that espousing Cultural Deprivation Theory may be viewed as the oppression 
of a minority culture by a majority one, it is ethically wrong. Gardner (1991, p. 53), for 
example, reports that the Kaluli people of New Guinea see babies as helpless creatures who 
neither understand nor are capable of speech. Rather than speaking to the children, Kaluli 
mothers speak n for them." Yet the family unit is strong with the generations cohabiting until 
death parts them. Despite the viability and grace of Kaluli practices, the tendency to label such 
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a culture "deprived" vis-a-vis our predominantly white, Anglo-Saxon, protestant, middle- 
class American culture is great. However, educators and education policymakers must not 
permit application of the model in everyday life to minimize or obliterate the good and moral 
qualities inherent in all cultures and peoples. 

Within the next 20 years, Americans will recognize that as the number of children 
declines as a percent of the U. S. population and people of color (non-Anglos) become the 
majority (see Hodgkinson, 1992), we literally cannot afford to allow the achievement of the 
"new majority" youngsters to lag behind their white counterparts. We cannot wait 20 years to 
do something about it. Preliminary research suggests that small class-size in the primary 
years of school may be an effective strategy for not only reducing the the white-minority 
achievement gap, but for enhancing the achievement of all students. 
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APPENDIX C: Tables of Additional Analyses of STAR Data Pertinent to Small Class as a 
"Gap Reduction" Strategy for Achievement Differences 

Minority Stanford Achievement Test (SAT) Mean Scaled Scores by Grade by 
Length of Time in Small Class-Size Obtained from "Unfiltered" Data Set 



Achievement 

Measures Length of Time in Small Class-size 

Grade 1 year 2 years 3 years 4 years 

SAT Reading K 433.79 

1 498.24 513.74 

2 565.83 565.66 575.28 

3 602.79 606.19 594.55 612.78 
SAT Math K 479.56 

1 51 1.88 524.42 

2 563.28 562.58 572.39 

3 610.89 604.39 594.90 613.63 
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White Stanford Achievement Test (SAT) Mean Scaled Scores by Grade by Length 
of Time in Small Class-Size Obtained from "Unfiltered" Data Set 



Achievement 










Measures 






Length of Time in Small Class-size 




Grade 


1 year 


2 years 


3 years 4 years 


SAT Reading 


K 


443.69 






1 


523.60 


542.73 






2 


587.66 


584.93 


604.67 




3 


611.12 


619.10 


619.07 633.43 


SAT Math 


K 


496.20 








1 


538.08 


548.42 






2 


582.68 


582.42 


599.83 




3 


615.19 


621 .32 


620.35 635.12 
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The White-Minority Achievement Gap: Differences in Stanford Achievement Test (SAT) 
Mean Scaled Scores by Grade by Length of Time in Small Class-Size 
Obtained from "Unfiltered" Data Set 



Achievement 

Measures Length of Time in Small Class-size 

Grade 1 year 2 years 3 years 4 years 

SAT Reading K 9.90 

1 25.36 28.99 

2 21.83 19.27 29.39 

3 8.33 12.91 24.52 20.65 
SAT Math K 16.64 

1 26.20 24.00 

2 19.40 19.84 27.44 

3 4.30 16.93 25.45 21.49 
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Minority Small-Class and White Regular-Class Subsample Differences in Stanford 
Achievement Test (SAT) Mean Scaled Scores by Grade by Length of Time in Class 

Obtained from "Unfiltered" Data Set 



Achievement 












Measures 






Length of ( ime in Class 






Grade 


1 year 


2 years 


3 years 


4 years 


SAT Reading 


K 


5.28 








1 


18.36 


18.96 








2 


21.17 


14.04 


42.22 






3 


16.50 


12.79 


16.06 


14.73 


SAT Math 


K 


8.78 










1 


16.92 


15.58 








2 


25.25 


16.66 


20.39 






3 


7.28 


19.34 


20.83 


16.80 
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